AITopics

doi: 10.1016/j.inffus.2025.103635

2510.23656

Country: North America > Canada > Quebec (0.28)

Genre: Research Report > New Finding (0.67)

Industry:

Transportation > Infrastructure & Services (0.88)
Transportation > Ground > Road (0.66)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceFeb-6-2025

Quantifying Correlations of Machine Learning Models

Li, Yuanyuan, Sarna, Neeraj, Lin, Yang

Machine Learning models are being extensively used in safety critical applications where errors from these models could cause harm to the user. Such risks are amplified when multiple machine learning models, which are deployed concurrently, interact and make errors simultaneously. This paper explores three scenarios where error correlations between multiple models arise, resulting in such aggregated risks. Using real-world data, we simulate these scenarios and quantify the correlations in errors of different models. Our findings indicate that aggregated risks are substantial, particularly when models share similar algorithms, training datasets, or foundational models. Overall, we observe that correlations across models are pervasive and likely to intensify with increased reliance on foundational models and widely used public datasets, highlighting the need for effective mitigation strategies to address these challenges.

correlation, large language model, machine learning, (19 more...)

2502.03937

Country:

North America > United States > California (0.05)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
North America > United States > Connecticut (0.04)
(2 more...)

Genre: Research Report > New Finding (0.49)

Industry:

Health & Medicine (0.93)
Energy (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.48)

arXiv.org Artificial IntelligenceFeb-6-2025

Multi-Agent Reinforcement Learning with Focal Diversity Optimization

Tekin, Selim Furkan, Ilhan, Fatih, Huang, Tiansheng, Hu, Sihao, Yahn, Zachary, Liu, Ling

The advancement of Large Language Models (LLMs) and their finetuning strategies has triggered the renewed interests in multi-agent reinforcement learning. In this paper, we introduce a focal diversity-optimized multi-agent reinforcement learning approach, coined as MARL-Focal, with three unique characteristics. First, we develop an agent-fusion framework for encouraging multiple LLM based agents to collaborate in producing the final inference output for each LLM query. Second, we develop a focal-diversity optimized agent selection algorithm that can choose a small subset of the available agents based on how well they can complement one another to generate the query output. Finally, we design a conflict-resolution method to detect output inconsistency among multiple agents and produce our MARL-Focal output through reward-aware and policy-adaptive inference fusion. Extensive evaluations on five benchmarks show that MARL-Focal is cost-efficient and adversarial-robust. Our multi-agent fusion model achieves performance improvement of 5.51\% compared to the best individual LLM-agent and offers stronger robustness over the TruthfulQA benchmark. Code is available at https://github.com/sftekin/rl-focal

large language model, machine learning, reinforcement learning, (17 more...)

2502.04492

Country: North America > United States > Florida > Miami-Dade County > Miami (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

arXiv.org Artificial IntelligenceDec-8-2023

The logic of NTQR evaluations of noisy AI agents: Complete postulates and logically consistent error correlations

Corrada-Emmanuel, Andrés

In his "ship of state" allegory (\textit{Republic}, Book VI, 488) Plato poses a question -- how can a crew of sailors presumed to know little about the art of navigation recognize the true pilot among them? The allegory argues that a simple majority voting procedure cannot safely determine who is most qualified to pilot a ship when the voting members are ignorant or biased. We formalize Plato's concerns by considering the problem in AI safety of monitoring noisy AI agents in unsupervised settings. An algorithm evaluating AI agents using unlabeled data would be subject to the evaluation dilemma - how would we know the evaluation algorithm was correct itself? This endless validation chain can be avoided by considering purely algebraic functions of the observed responses. We can construct complete postulates than can prove or disprove the logical consistency of any grading algorithm. A complete set of postulates exists whenever we are evaluating $N$ experts that took $T$ tests with $Q$ questions with $R$ responses each. We discuss evaluating binary classifiers that have taken a single test - the $(N,T=1,Q,R=2)$ tests. We show how some of the postulates have been previously identified in the ML literature but not recognized as such - the \textbf{agreement equations} of Platanios. The complete postulates for pair correlated binary classifiers are considered and we show how it allows for error correlations to be quickly calculated. An algebraic evaluator based on the assumption that the ensemble is error independent is compared with grading by majority voting on evaluations using the \uciadult and and \texttt{two-norm} datasets. Throughout, we demonstrate how the formalism of logical consistency via algebraic postulates of evaluation can help increase the safety of machines using AI algorithms.

binary classifier, classifier, evaluation, (16 more...)

2312.05392

Country: North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.80)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Corrada-Emmanuel, Andrés, Pantridge, Edward, Zahrebelski, Eddie, Chaganti, Aditya, Simeonov, Simeon

Independence Tests Without Ground Truth for Noisy Learners

arXiv.org Machine LearningOct-28-2020

Exact ground truth invariant polynomial systems can be written for arbitrarily correlated binary classifiers. Their solutions give estimates for sample statistics that require knowledge of the ground truth of the correct labels in the sample. Of these polynomial systems, only a few have been solved in closed form. Here we discuss the exact solution for independent binary classifiers - resolving an outstanding problem that has been presented at this conference and others. Its practical applicability is hampered by its sole remaining assumption - the classifiers need to be independent in their sample errors. We discuss how to use the closed form solution to create a self-consistent test that can validate the independence assumption itself absent the correct labels ground truth. It can be cast as an algebraic geometry conjecture for binary classifiers that remains unsolved. A similar conjecture for the ground truth invariant algebraic system for scalar regressors is solvable, and we present the solution here. We also discuss experiments on the Penn ML Benchmark classification tasks that provide further evidence that the conjecture may be true for the polynomial system of binary classifiers.

artificial intelligence, classifier, machine learning, (16 more...)

arXiv.org Machine Learning

2010.15662

Country: Europe > Finland > Uusimaa > Helsinki (0.04)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Liao, Yuansong, Moody, John E.

Constructing Heterogeneous Committees Using Input Feature Grouping: Application to Economic Forecasting

Neural Information Processing SystemsDec-31-2000

Yuansong Liao and John Moody Department of Computer Science, Oregon Graduate Institute, P.O.Box 91000, Portland, OR 97291-1000 Abstract The committee approach has been proposed for reducing model uncertainty and improving generalization performance. The advantage of committees depends on (1) the performance of individual members and (2) the correlational structure of errors between members. This paper presents an input grouping technique for designing a heterogeneous committee. With this technique, all input variables are first grouped based on their mutual information. Statistically similar variables are assigned to the same group.

committee member, information, input feature, (14 more...)

Country:

North America > United States > Oregon > Multnomah County > Portland (0.24)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Netherlands (0.04)

Genre: Research Report (0.69)

Industry: Banking & Finance > Economy (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Forecasting (0.42)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.30)

Liao, Yuansong, Moody, John E.

Constructing Heterogeneous Committees Using Input Feature Grouping: Application to Economic Forecasting

Neural Information Processing SystemsDec-31-2000

committee member, information, input feature, (14 more...)

Country:

North America > United States > Oregon > Multnomah County > Portland (0.24)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Europe > Netherlands (0.04)

Genre: Research Report (0.69)

Industry: Banking & Finance > Economy (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Forecasting (0.42)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.30)

Parmanto, Bambang, Munro, Paul W., Doyle, Howard R.

Improving Committee Diagnosis with Resampling Techniques

Neural Information Processing SystemsDec-31-1996

Central to the performance improvement of a committee relative to individual networks is the error correlation between networks in the committee. We investigated methods of achieving error independence between the networks by training the networks with different resampling sets from the original training set. The methods were tested on the sinwave artificial task and the real-world problems of hepatoma (liver cancer) and breast cancer diagnoses.

error correlation, fraction, replicate, (14 more...)

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.05)
North America > United States > Wisconsin (0.04)
North America > United States > Maryland > Montgomery County > Bethesda (0.04)
North America > United States > California > San Diego County > San Diego (0.04)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.89)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.70)

Parmanto, Bambang, Munro, Paul W., Doyle, Howard R.

Improving Committee Diagnosis with Resampling Techniques

Neural Information Processing SystemsDec-31-1996

error correlation, fraction, replicate, (14 more...)

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.05)
North America > United States > Wisconsin (0.04)
North America > United States > Maryland > Montgomery County > Bethesda (0.04)
North America > United States > California > San Diego County > San Diego (0.04)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.89)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.70)

Parmanto, Bambang, Munro, Paul W., Doyle, Howard R.

Improving Committee Diagnosis with Resampling Techniques

Neural Information Processing SystemsDec-31-1996

Central to the performance improvement of a committee relative to individual networks is the error correlation between networks in the committee. We investigated methods of achieving error independence betweenthe networks by training the networks with different resampling sets from the original training set. The methods were tested on the sinwave artificial task and the real-world problems of hepatoma (liver cancer) and breast cancer diagnoses.

error correlation, fraction, replicate, (14 more...)

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.05)
North America > United States > Wisconsin (0.04)
North America > United States > Maryland > Montgomery County > Bethesda (0.04)
North America > United States > California > San Diego County > San Diego (0.04)

Industry: Health & Medicine > Therapeutic Area > Oncology (0.89)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.70)